CDS
Accession Number | TCMCG075C11718 |
gbkey | CDS |
Protein Id | XP_007040873.1 |
Location | complement(join(36099412..36100461,36101101..36101244,36102276..36102353,36102543..36102659,36102762..36102818,36103004..36103135,36103214..36103411,36104282..36104392,36104512..36104727,36104822..36105007,36105737..36106303)) |
Gene | LOC18606917 |
GeneID | 18606917 |
Organism | Theobroma cacao |
Protein
Length | 951aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007040811.2 |
Definition | PREDICTED: AP-4 complex subunit epsilon [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | U |
Description | AP-4 complex subunit |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko04131 [VIEW IN KEGG] |
KEGG_ko |
ko:K12400
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko04142
[VIEW IN KEGG] map04142 [VIEW IN KEGG] |
GOs |
GO:0005575
[VIEW IN EMBL-EBI] GO:0005622 [VIEW IN EMBL-EBI] GO:0005623 [VIEW IN EMBL-EBI] GO:0005737 [VIEW IN EMBL-EBI] GO:0005829 [VIEW IN EMBL-EBI] GO:0005911 [VIEW IN EMBL-EBI] GO:0009506 [VIEW IN EMBL-EBI] GO:0016020 [VIEW IN EMBL-EBI] GO:0030054 [VIEW IN EMBL-EBI] GO:0030117 [VIEW IN EMBL-EBI] GO:0030119 [VIEW IN EMBL-EBI] GO:0030124 [VIEW IN EMBL-EBI] GO:0032991 [VIEW IN EMBL-EBI] GO:0044424 [VIEW IN EMBL-EBI] GO:0044425 [VIEW IN EMBL-EBI] GO:0044444 [VIEW IN EMBL-EBI] GO:0044464 [VIEW IN EMBL-EBI] GO:0048475 [VIEW IN EMBL-EBI] GO:0055044 [VIEW IN EMBL-EBI] GO:0098796 [VIEW IN EMBL-EBI] |
Sequence
CDS: ATGGGCTCCCAAGGCGGATTTTATCAGTCCAAAGAGTTTCTGGATCTGGTGAAGTCCATCGGCGAGGCTCGATCCAAGGCTGAAGAAGACCGGATTGTTCTCAACGAGATCGAGACTCTCAAACGCCGCATCTCGGAGCCCGATATCCCCAAGCGCAAGATGAAAGAGTACATCATCCGATTGGTTTACGTTGAGATGCTCGGTCACGATGCTTCCTTCGGTTACATTCACGCCGTTAAAATGACTCACGATGATAGTCTTCTCGTCAAGAGAACCGGTTACTTGGCCGTCACACTGTTTTTAAATGAAGATCACGATTTGATCATTTTGATTGTCAATACCATACAGAAAGATTTGAAGTCTGACAATTACTTGGTGGTTTGCGCCGCCTTGAATGCCGTTTGTAAGTTGATCAATGAGGAGACAATTCCTGCGGTGTTGCCGCAGGTTGTGGAGTTGCTGGGGCATCCTAAGGAGGCTGTGCGGAAGAAGGCCATCATGGCTCTCCATCGCTTTTATCAGAAATCCCCCTCTTCTGTCTCGCATCTTGTCTCCAATTTTCGCAAGAGGCTTTGTGATAATGATCCTGGAGTCATGGGTGCAACCCTTTGTCCGCTTTTTGATCTTATAACAATAGATGTTAATTCTTACAAAGATTTGGTTGTCAGCTTTGTAAGCATTCTTAAACAAGTAGCTGAACGCAGACTACCAAAGGCATATGATTACCATCAAATGCCAGCTCCATTTATTCAGATCAAATTGCTGAAAATTCTGGCTTTGCTTGGAAGTGGTGACAAGCAAGCAAGTGAAAACATGTACACTGTAGTGGGAGACTTATTCAGGAAGTGCGATTCGTCAAGTAATATAGGAAATGCCGTCCTTTATGAGTGCATATGCTGTGTCTCCTCTATATATCCCAATGCCAAGTTATTAGAGTCTGCAGCTGATGTTATATCGAGATTTTTGAAGAGTGACAGTCATAACCTAAAATACATGGGCATTGATGCTCTTGGCCGATTGATAAAGATAAGTCCAGATATTGCCGAGCAACACCAACTGGCTGTGATTGATTGCTTAGAGGACCCAGATGACACTCTGAAGAGAAAAACCTTTGAACTGCTGTATAAGATGACCAAGTCTACAAATGTGGAGGTTATTGTTGATCGCATGATTGATTACATGATTAGCATTAATGACAATCATTATAAAACTGAAATAGCATCTCGATGTGTTGAACTTGCGGAGCAATTTGCGCCAAGCAATCAGTGGTTCATCCAGACCATGAATAAAGTTTTTGAGCATGCGGGAGATCTGGTGAATATTAAGGTAGCACACAATTTGATGCGGTTGATTGCTGAGGGATTTGGAGAGGATGATGATTCTGCAGACAGTCAACTGAGATCATCTGCTGTGGAGTCATACTTGCGCATTCTTGGTGAACCTAAGTTGCCATCTGTTTTTCTTCAAGTAATTTGTTGGGTGTTGGGGGAGTATGGAACTGCTGATGGGAAGTTCTCTGCTTCCTACATTACTGGGAAGCTATGTGATGTGGCGGAGGCATATTCTAATGATGAGACTGTTAAGGCATATGCAGTTACAGCTCTCATGAAAATATATGCATTTGAAATAGCAGCACGGAGGAAAGTAGATCTGCTGCCTGAGTGTCAATCTTTAATGGAAGAATTATTGGCTTCTCACTCAACAGATTTGCAGCAACGTGCCTATGAACTGCAAGCAGTGATTGGCCTTGATGCTCATGCTGTTGAGTGTATTATGCCGTCAGATGCAAGTTGTGAAGATATTGAGGTTGATAAAGGCCTTTCATTCCTTAATGGTTATGTTGAAGAGTCGATAGAAAAAGGTGCTCAGCCCTATATTCCTGAGAGTGAACGCTCTGGAATGCTAAATATTAGCAATTTTAGGAACCAAGATCACCACGAAGCTTCATCACATGGTCTCAGGTTTGAGGCATATGAGCTTCCAAAGCCTACAGTGCAATCTAGGATTCCTCCAGCATCACTTGCTTCAACCGAACTTGTTCCAGTACCTGAGCCAACGTATCTTAGGGAGAGCTACCAGACTCCTTCTGTGACATCTGTATCATCAGATGCAGGATCCTCAGAGCTCAAGCTTCGACTAGATGGAGTCCAAAAGAAGTGGGGTAAACCAACATACGCTCCTGCAACGTCTACCTCAAATTCCACAGCACAGAAAACAGTTAATGGGGTCACACAAGTTGAGGGGGCAAGTTCTACAAATTCAAGAACGCGTGAAACCTATGATTCAAGGAAACCACAGGTTGAAATATCTCCAGAAAAGCAGAAGCTTGCTGCTTCGCTGTTTGGAGGTTCATCAAAAACGGAAAAGAGGCCAGCTACTGGTCATAAGACTTCAAAGGCAAGCACCCACATGGTGGAGAAGTCTCATGTGCCAAAGTCTAGCATGGAAGTTGCATCGGAAAAGACAGCTCCTGTTCAACCACCTCCGGACTTGCTTGATCTGGGGGAACCAACTGTCACAAGTATTGCCCCTTTCGTAGATCCATTTAAACAATTGGAGGGCCTTCTTGACCCAACTCAAGTTGGTTCAGCTGCTGCTACCAAATCACCTGATATTATGGCATTGTATGTAGACACACCTGCTGGCATACACAATAAAGATGACGGTGATCTTTTATCTGGCTTGTCAAATCCTTCAGTGACAAACATGCCTGGCGGTACCACAACCACACAACAAGAGCAACGAAGTAAGGGTCCCAACCCTAAAGATTCCTTGGAAAAGGATGCACTGGTTAGGCAGATGGGTGTGAACCCATCGAGTCAGAATCCAAACTTGTTTAGAGATCTACTTGGCTGA |
Protein: MGSQGGFYQSKEFLDLVKSIGEARSKAEEDRIVLNEIETLKRRISEPDIPKRKMKEYIIRLVYVEMLGHDASFGYIHAVKMTHDDSLLVKRTGYLAVTLFLNEDHDLIILIVNTIQKDLKSDNYLVVCAALNAVCKLINEETIPAVLPQVVELLGHPKEAVRKKAIMALHRFYQKSPSSVSHLVSNFRKRLCDNDPGVMGATLCPLFDLITIDVNSYKDLVVSFVSILKQVAERRLPKAYDYHQMPAPFIQIKLLKILALLGSGDKQASENMYTVVGDLFRKCDSSSNIGNAVLYECICCVSSIYPNAKLLESAADVISRFLKSDSHNLKYMGIDALGRLIKISPDIAEQHQLAVIDCLEDPDDTLKRKTFELLYKMTKSTNVEVIVDRMIDYMISINDNHYKTEIASRCVELAEQFAPSNQWFIQTMNKVFEHAGDLVNIKVAHNLMRLIAEGFGEDDDSADSQLRSSAVESYLRILGEPKLPSVFLQVICWVLGEYGTADGKFSASYITGKLCDVAEAYSNDETVKAYAVTALMKIYAFEIAARRKVDLLPECQSLMEELLASHSTDLQQRAYELQAVIGLDAHAVECIMPSDASCEDIEVDKGLSFLNGYVEESIEKGAQPYIPESERSGMLNISNFRNQDHHEASSHGLRFEAYELPKPTVQSRIPPASLASTELVPVPEPTYLRESYQTPSVTSVSSDAGSSELKLRLDGVQKKWGKPTYAPATSTSNSTAQKTVNGVTQVEGASSTNSRTRETYDSRKPQVEISPEKQKLAASLFGGSSKTEKRPATGHKTSKASTHMVEKSHVPKSSMEVASEKTAPVQPPPDLLDLGEPTVTSIAPFVDPFKQLEGLLDPTQVGSAAATKSPDIMALYVDTPAGIHNKDDGDLLSGLSNPSVTNMPGGTTTTQQEQRSKGPNPKDSLEKDALVRQMGVNPSSQNPNLFRDLLG |